add multimodal support for qwen2.5 #90

abdulazizab2 · 2025-05-06T14:50:42Z

Issue

Only multi-modal input supported in vllm backend is Llama 3.2

Contribution

Add support for qwen2.5 multi-modal input
Refactor code to to easily add other multi-modal input models

MrYang1916

The code has been tested to work properly and can perform normal inference requests on qwen2.5 multi-mode data.

MohammedAlkhrashi · 2025-07-07T09:39:12Z

Thanks, that worked for me

soulseen · 2025-07-11T07:25:07Z

@abdulazizab2 is this PR support for Qwen/Qwen2.5-VL-7B-Instruct ?

abdulazizab2 · 2025-07-11T17:14:12Z

@abdulazizab2 is this PR support for Qwen/Qwen2.5-VL-7B-Instruct ?

It should support all Qwen2.5-VL architectures. It worked for 7B specifically

soulseen · 2025-07-17T09:00:43Z

@abdulazizab2 is this PR support for Qwen/Qwen2.5-VL-7B-Instruct ?

It should support all Qwen2.5-VL architectures. It worked for 7B specifically

Is the Triton support request image URL for inference? And I always get an error as follows:

{"error":"Error generating stream: Invalid base64-encoded string: number of data characters (5) cannot be 1 more than a multiple of 4"}

My request data is like:

headers = {
    "Content-Type": "application/json"
}

prompt= "what is a usdot number"
img_url = "http:/xxxx.com/xx.jpg"

data = {
    "text_input": "你好",
    # "text_input": "Describe the content of this image.",
    "image": img_url,
    "parameters": {
        "stream": False,
        "max_tokens": 256,
        "temperature": 0.7
    }
}

abdulazizab2 · 2025-07-17T09:09:25Z

@soulseen

Try this sample request and refine it with your parameters

#!/bin/bash

# Define image URL and local path
image_url="https://upload.wikimedia.org/wikipedia/en/thumb/7/7d/Lenna_%28test_image%29.png/440px-Lenna_%28test_image%29.png"
image_path="lenna.png"

# Download the image
curl -s -o "$image_path" "$image_url"

# Base64 encode the image without newlines
image_base64=$(base64 -w 0 "$image_path")

# Create the JSON payload
payload_file=$(temp)
cat > "$payload_file" <<EOF
{
    "text_input": "Describe these images ?",
    "image": "$image_base64",
    "sampling_parameters": {
        "max_tokens": 256,
        "temperature": 0
    },
    "exclude_input_in_output": true
}
EOF

# Send the POST request
url="http://localhost:8000/v2/models/qwen2.5_vl_3b/generate"
response=$(curl -s -X POST "$url" -H "Content-Type: application/json" -d @"$payload_file")

# Clean up
rm "$payload_file"

# Output the response
echo "$response"

soulseen · 2025-07-17T09:57:52Z

@abdulazizab2 thank you for your share, but I also get an error like:

E0717 09:56:34.965292 80494 model.py:507] "[vllm] Error generating stream: type object 'c_python_backend_utils.Logger' has no attribute 'log_warning'"

triton version: tritonserver:25.06-py3

abdulazizab2 · 2025-07-17T10:05:06Z

@abdulazizab2 thank you for your share, but I also get an error like:
E0717 09:56:34.965292 80494 model.py:507] "[vllm] Error generating stream: type object 'c_python_backend_utils.Logger' has no attribute 'log_warning'"
triton version: tritonserver:25.06-py3

Can you check with the following versions:
triton version: tritonserver:25.01-py3
vllm version: 0.8.5

add multimodal support for qwen2.5

519f824

MrYang1916 approved these changes Jun 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add multimodal support for qwen2.5 #90

add multimodal support for qwen2.5 #90

Uh oh!

abdulazizab2 commented May 6, 2025

Uh oh!

MrYang1916 left a comment •

edited

Loading

Uh oh!

MohammedAlkhrashi commented Jul 7, 2025

Uh oh!

soulseen commented Jul 11, 2025

Uh oh!

abdulazizab2 commented Jul 11, 2025

Uh oh!

soulseen commented Jul 17, 2025

Uh oh!

abdulazizab2 commented Jul 17, 2025

Uh oh!

soulseen commented Jul 17, 2025

Uh oh!

abdulazizab2 commented Jul 17, 2025

Uh oh!

Uh oh!

add multimodal support for qwen2.5 #90

Are you sure you want to change the base?

add multimodal support for qwen2.5 #90

Uh oh!

Conversation

abdulazizab2 commented May 6, 2025

Issue

Contribution

Uh oh!

MrYang1916 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MohammedAlkhrashi commented Jul 7, 2025

Uh oh!

soulseen commented Jul 11, 2025

Uh oh!

abdulazizab2 commented Jul 11, 2025

Uh oh!

soulseen commented Jul 17, 2025

Uh oh!

abdulazizab2 commented Jul 17, 2025

Uh oh!

soulseen commented Jul 17, 2025

Uh oh!

abdulazizab2 commented Jul 17, 2025

Uh oh!

Uh oh!

MrYang1916 left a comment •

edited

Loading